AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Reinforcement Learning Inference Optimization

# Reinforcement Learning Inference Optimization

Acereason Nemotron 14B
Other
AceReason-Nemotron-14B is a math and code reasoning model trained through reinforcement learning, based on DeepSeek-R1-Distilled-Qwen-14B, excelling in math and code reasoning tasks.
Large Language Model Transformers
A
nvidia
7,863
70
Open RS1
MIT
A small-scale large language model enhanced by reinforcement learning, focused on improving the reasoning capabilities of a 1.5B parameter model
Large Language Model Transformers
O
knoveleng
6,229
4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase